Treebank of Chinese Bible Translations

نویسنده

  • Andi Wu
چکیده

This paper reports on a treebanking project where eight different modern Chinese translations of the Bible are syntactically analyzed. The trees are created through dynamic treebanking which uses a parser to produce the trees. The trees have been going through manual checking, but corrections are made not by editing the tree files but by re-generating the trees with an updated grammar and dictionary. The accuracy of the treebank is high due to the fact that the grammar and dictionary are optimized for this specific domain. The tree structures essentially follow the guidelines of the Penn Chinese Treebank. The total number of characters covered by the treebank is 7,872,420 characters. The data has been used in Bible translation and Bible search. It should also prove useful in the computational study of the Chinese language in general.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incorporating Alternate Translations into English Translation Treebank

New annotation guidelines and new processing methods were developed to accommodate English treebank annotation of a parallel English/Chinese corpus of web data that includes alternate English translations (one fluent, one literal) of expressions that are idiomatic in the Chinese source. In previous machine translation programs, alternate translations of idiomatic expressions had been present in...

متن کامل

Multilingual Aligned Parallel Treebank Corpus Reflecting Contextual Information And Its Applications

This paper describes Japanese-English-Chinese aligned parallel treebank corpora of newspaper articles. They have been constructed by translating each sentence in the Penn Treebank and the Kyoto University text corpus into a corresponding natural sentence in a target language. Each sentence is translated so as to reflect its contextual information and is annotated with morphological and syntacti...

متن کامل

The Bible in Afrikaans: A direct translation – A new type of church Bible

Translating the Bible so that target audiences can easily understand the meaning of the text has dominated the theory and practice of Bible translation since the 1960s. Source oriented translations that are typically associated with word-for-word translations received little theoretical reflection. However, developments in Translation Studies have made it clear that the latter type of translati...

متن کامل

The National Image of the World in an Objective Process of Globalization

The article examines process of globalization as continuation of process the creation of Christendom. The author thinks that translations of Bible from Greek or ancient Jewish on other languages are the example of first step in unification Christendom. And consequently its first step of globalization. Russian translation of Bible is showing distinction from other translations, but experimentall...

متن کامل

Creating a massively parallel Bible corpus

We present our ongoing effort to create a massively parallel Bible corpus. While an ever-increasing number of Bible translations is available in electronic form on the internet, there is no large-scale parallel Bible corpus that allows language researchers to easily get access to the texts and their parallel structure for a large variety of different languages. We report on the current status o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010